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Abstract — This paper describe different techniques we can 
use for multiple sequence alignment, we have analyze 
different techniques we can use and find which one is better 
.We have taken several example as parameter. There is 
different approachthat we can apply in various conditions. 
This paper is concerned with the efficient execution 
Multiple sequence alignment methods in multiple client 
environments Multiple order alignment (MSA) is in a 
computational form Expensive method, which is commonly 
used in Large databases of computational and molecular 
biology Protein and gene sequences are available for 
scientists Community frequent, these databases are 
accessed to execute multiple user MSA queries Data is in 
server. 
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I. INTRODUCTION 

A multiple sequence alignment (MSA) of three or more 
biological sequences, generally protein, DNA, RNA, or a 
view of the alignment. In many cases, the input set of query 
sequences is evolutionary relationships by which they share 
a lineage and are descended from a common ancestor are 
assumed. From the resulting MSA, sequence homology can 
be inferred and phylogenetic analysis sequences shared 
evolutionary origins can be conducted to assess. Correct 
alignment as image depicting scenes of mutation events 
such as point mutations (single amino acid or nucleotide 
changes) that appear as different characters in a single 
alignment column, and insertion or deletion mutations 
illustrate ( indels or gaps) that appear as hyphens in one or 
more of the sequences in the alignment. Multiple sequence 
alignment often protein domains, tertiary and secondary 
structures, and even individual amino acids or nucleotides 
in the sequence is used to access protection. 

Multiple sequence alignment process of aligning such a 
sequence set shows. MSAs require more sophisticated way 
than pairwise alignment because they are more 
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computationally complex. Most multiple sequence 
alignment programs, more than a few shots of medium 
length because identifying the optimal alignment heuristic 
methods rather than global optimization is extremely 
computationally expensive. 

Dynamic programming and computational complexity 

A direct method for producing an MSA uses dynamic 
programming technique to identify the globally optimal 
alignment solution. A gap penalty and a substitution matrix 
of amino acids' chemical properties and the evolutionary 
probability based on the similarity of the amino acid or the 
possibility of assigning a score to each possible alignment 
couple: For protein, this method usually involves two sets 
of parameters change. For nucleotide sequences a similar 
gap penalty is used, but a very simple substitution matrix, 
which corresponds only and is incompatible, is typical. 
Substitution matrix score in either all positive or a mixture 
of positive and negative in the case of a global alignment is, 
however, should be both positive and negative in the case of 
a local alignment. Commonly used to measure 
computational complexity with big O notation expresses a 
naive MSA O (LengthNseqs) takes time to produce. N 
sequences for global optimal way is shown to be an NP- 
complete problem are to find. In 1989, Carrillo- based on 
Lipman algorithm, Altschul is a practical method that uses 
pairwise alignments to constrain the n-dimensional search 
space presented. In this approach pair wise dynamic 
programming alignments are performed on each pair of 
sequences in the query set, and only the alignment of the N- 
dimensional space near the intersection en route alignment 
is searched for. The MSA program in alignment each 
position (the so-called sum pair score) the sum of all of the 
pairs of characters and multiple sequence alignment, 
optimize building has been implemented in a software 
program emented in a software program for constructing 
multiple sequence alignments. 

H. ITERATIVE METHODS 

A set of methods to produce MSAs while reducing the 
errors inherent in progressive methods as "walk" are 
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classified because they work similarly to progressive 
methods but repeatedly the initial scenes as well as to 
organize new scenes add to the growing MSA. is not 
considered further. This approximation improves efficiency 
at the expense of accuracy. Easily applied to a variety of 
different ways to walk and is made available in software 
packages; Review and comparison is useful, but typically a 
"best" technique should avoid choosing. Software package 
PRRN / PRRP its MSA alignment score to optimize the use 
of a hill climbing algorithm and iteratively both alignment 
weights and locally different or "incomplete" the growing 
areas of MSA. When refining an alignment previously 
constructed by a faster method performs best PRRP. 

Another running program, DIALIGN, without introducing a 
gap penalty Berry sub-segments or sequence to focus 
narrowly on the local alignment takes an unusual approach. 
The alignment of individual motifs, a dot matrix plot in a 
pairwise alignment similar to that achieved with a matrix 
representation. An alternative way to fast local alignments 
as anchor points or for a slower global alignment process 
"seed" that uses the chaos / DIALIGN suite is implemented. 
A third popular iteration method based muscle (Log 
expectation multiple sequence alignment), a more accurate 
distance measure to assess the association of two scenes 
with the progressive improvement of the methods cited. The 
distance measure is updated between iteration stages 
(although, in its original form, depending on the muscle that 
was able refinement contained only 2-3 iterations). 
Consensus methods 

The consensus sequences of the same set of methods have 
many different alignments try to find the optimal multiple 
sequence alignment. There are two commonly used 
methods of consensus, M-Coffee and are MergeAlign.M- 
coffee from seven different ways to generate multiple 
sequence alignment alignment consensus uses. MergeAlign 
sequence development or various models of multiple 
sequence alignments generated using different methods of 
input from any number of alignment alignment consensus is 
capable of generating. MergeAlign default options for 
developing the protein sequence alignment generated using 
91 different models using an alignment consensus 
estimates. 

Hidden Markov models 

Hidden Markov models are probabilistic models that gap, 
can assign likelihoods to all possible combinations of 
matches and mismatches to determine the most likely MSA 
or set of possible MSAs are. HMMs can produce both 
global and local alignment. Although HMM-based methods 
have been developed relatively recently, they specifically 


for sequences that contain overlapping areas offer 
significant improvements in computational speed typical 
HMM-based methods directed acyclic MSA possible entries 
in the columns of an MSA, which consists of a series of 
nodes representing a partial order graph, known as the 
graph representing it as a form of work, in the next column 
possible characters alignment. In terms of a typical hidden 
Markov model, the observed state individual alignment 
columns and the "hidden" states represent the presumed 
ancestral sequence from which the sequences in the query 
set are hypothesized to have landed. An efficient search 
variant of the dynamic programming method, Viterbi 
algorithm is known as, generally subsequent query set to 
produce a new MSA to the next scene in the growing line is 
used for MSA .Because the alignment of prior sequences is 
updated at each new addition to the scene is different from 
progressive alignment methods. However, like progressive 
methods, this technology sets the order in which the query 
sequences are integrated into alignment, especially when 
the sequences are distantly related can be affected. 

Genetic algorithms and simulated annealing 
Standard optimization techniques in computer science - 
both of which were inspired by, but directly not reproduce 
the physical processes - even more efficiently produce 
quality MSAs have been used in an attempt to. One such 
technique, genetic algorithms, an attempt is roughly 
estimated that the evolutionary process has lead to 
divergence in the query set to emulate the MSA has been 
used for production. Method possible MSAs into fragments 
and repeatedly breaking a series of different positions by 
rearranging those fragments with the introduction of 
interval functions. A general objective function is optimized 
during the simulation, most generally, "added amount" 
maximum function introduced in dynamic programming- 
based MSA methods. A technique for protein sequences 
software Saga (by genetic sequence alignment algorithm) 
and has been applied to its counterpart in the RNA is called 
passion. 

ATATATAT - 
III III I 
- TATATATA 

Alignment of ATATATAT against TATATATA. 

The technique of simulated annealing is another method by 
which an existing MSA produced by an input alignment 
already occupies space better than the alignment designed 
to find areas is refined by a series of rearrangements. Like 
the genetic algorithm method, simulated annealing sum- 
Of-pairs function maximizes an objective function. A 
simulated annealing symbolic "C" factor that the rate at 
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which rearrangements proceed and determine the likelihood 
of each rearrangement uses; to find out. This approach 
program MSASA (simulated annealing multiple sequence 
alignment) has been implemented in. 

Non-Coding Multiple Sequence Alignment 
Non-coding DNA regions, especially TFBSs, but not 
necessarily more protected and are evolutionarily related, 
and may be associated with a non-common ancestor. Thus, 
the protein-coding regions of DNA sequences and the 
assumptions used in line with those that hold for TFBS 
sequences are inherently different have little meaning for 
the TFBS sequences. This is especially important when the 
same TFBS monitoring model to predict the unknown 
places known to be trying to align the sequences TFBS. 
Therefore, multiple sequence alignment methods and 
working hypothesis underlying evolutionary basis neighbor 
thermodynamic information binding site, Edna's lowest 
thermodynamic binding sites for the alignment of the line 
for the protection of exclusivity to include use as operators 
adjust published need to. 

Alignment visualization and quality control 
Necessary use of heuristics for multiple alignment means an 
arbitrary set of proteins, there is always a good chance that 
an alignment will contain errors is. For example, evaluation 
of programs using key benchmark alignment BAliBase 
coalition found that at least 24% of all pairs of amino acids 
were aligned incorrectly. As the number of sequence 
divergence increases and many more errors because of the 
nature simply will MSA heuristic algorithms. Multiple 
sequence alignment audience visually, often two or more 
views to be reviewed and annotated functional sites 
inspected by the quality of the alignment enable alignment. 
Many also be edited in order to enable an optimal alignment 
'curated' alignment suitable for use in phylogenetic analysis 
or comparative modeling to achieve these (usually minor) to 
correct errors. 

Use in phylogenetics 

Multiple sequence alignments can be used to create a 
phylogenetic tree. This is made possible by two factors. The 
first is known as the functional domain that annotated 
sequences in non-annotated sequences can be used for 
alignment. The other is known to be functionally important 
protected areas can be found. This makes it possible to 
analyze multiple sequence alignment and symmetry through 
the evolutionary relationships between the sequences to be 
used to find. Point mutations and insertion or deletion 
events (called indels) can be detected. 


Multiple sequence alignment such binding sites, active sites, 
or sites similar to other important tasks, such as by applying 
protected domain to identify functionally important sites 
can be used. When multiple sequence alignment, looking at 
different aspects of the scenery when the scenery than 
useful. Identifying these aspects, equality, and compliance 
are included. Means of identification of the remains that 
have similar views on their respective positions. On the 
other hand, the similarity with the sequences being 
compared quantitatively similar to what remains. For 
example, in the case of nucleotide sequences, pyrimidines 
are considered similar to each other, as are purines. 
Eventually equality, in conformity leads to scenes that are 
more similar, they are close to being homologous. This 
similarity in views can go to for help finding the common 
ancestry. 

III. CONCLUSION 

Multiple Sequence Alignment is an extension of Junk 
alignment to include more than two scenes at one time. 
Many of our alignment methods try to align all the 
sequences in a given query set. Efficient fitness value 
function, crossover and mutation strategy is the result of 
work. After all, it is trying that our methods will be greatly 
contributed to the efficient solution of many sequence 
alignment problems. 
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